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ABSTRACT 

We propose a fresh look at the Main Galaxy Sample of the Sloan Digital Sky 
Survey by packing the galaxies in stellar mass and redshift bins. We show how impor¬ 
tant it is to consider the emission-line equivalent widths, in addition to the commonly 
used emission-line ratios, to properly identify retired galaxies (i.e. galaxies that have 
stopped forming stars and are ionized by their old stellar populations) and not mis¬ 
take them for galaxies with low-level nuclear activity. We find that the proportion 
of star-forming galaxies decreases with decreasing redshift in each mass bin, while 
that of retired galaxies increases. Galaxies with have formed all their 

stars at redshift larger than 0.4. The population of AGN hosts is never dominant 
for galaxy masses larger than We warn about the effects of stacking galaxy 

spectra to discuss galaxy properties. We estimate the lifetimes of active galactic nu¬ 
clei (AGN) relying entirely on demographic arguments — i.e. without any assumption 
on the AGN radiative properties. We find upper-limit lifetimes of about 1-5 Gyr for 
detectable AGN in galaxies with masses between IO^^-IO^^Mq. The lifetimes of the 
AGN-dominated phases are a few 10^ yr. Finally, we compare the star-formation his¬ 
tories of star-forming, AGN and retired galaxies as obtained by the spectral synthesis 
code STARLIGHT. Once the AGN is turned on it inhibits star formation for the next 
~ 0.1 Gyr in galaxies with masses around IO^^Mq, ^ 1 Gyr in galaxies with masses 
around 10^ 

Key words: galaxies: evolution - galaxies: statistics - galaxies: stellar content - 
galaxies : active. 


1 INTRODUCTION 

It is nowadays considered that episodes of nuclear activity 
are of prime importance in the evolution of galaxies and 
in the building up of the present-day Universe, due to the 
strong interplay believed to exist between the active galactic 
nuclei (AGN) and star formation (SF) (see Fabian 2012 for 
a detailed review). Observational evidence of this interplay 
requires a large and complete sample of galaxies with ad¬ 
equate spectra and well defined criteria to classify galaxies 
into appropriate categories. One of the pioneering studies 
in this direction is that of Huchra & Burg (1992), based 
on the CfA redshift survey (Huchra et al. 1983) which is a 
magnitude-limited spectroscopic survey of 2500 galaxies. It 


was found that classical Seyfert galaxies constitute about 1% 
of field galaxies with absolute magnitudes Mb smaller than 
—20, and that the percentage of AGN hosts increases with 
luminosity, reaching 20% for galaxies with Mb below —21 
(using Hq = 100 km s“^). Since then, the Sloan Digital Sur¬ 
vey (SDSS, York et al. 2000) has provided a spectroscopic 
database of nearly one million galaxies which allows one to 
study the properties of galaxies up to a redshift z ~ 0.4. 
Many studies have used SDSS data to address the nature 
of AGN hosts and the relation between nuclear activity and 
star formation (e.g. Kauffmann et al. 2003 - referred to as 
K03 in this paper Schawinski et al. 2007; Lee et al. 2007; 
Deng et al. 2012; Lamassa & Heckman 2013). Some studies 
have complemented optical data from the SDSS with data 
at other wavelengths (e.g. Kauffmann et al. 2007 for the UV; 
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Constantin et al. 2009 for X-rays; Rosario et al. 2013 for the 
mid-infrared). 

Most of these studies adopt the classification used by 
K03 based on the [O iii] A5007/H^ vs [N ii] A6584/Ha 
emission-line ratio diagram introduced by Baldwin et al. 
(1981, commonly referred to as the BPT diagram), in which 
galaxies are divided into star-forming (those below the di- 
visory line introduced by K03), “pure” AGN (those lying 
above the Kewley et al. 2001 - KOI- divisory line), and the 
“transition” or “composite” ones lying in-between. As em¬ 
phasized by Cid Fernandes et al. (2010, 2011) this classifi¬ 
cation leaves aside a large number of galaxies that cannot 
be classified in the BPT diagram because they lack some 
of the required emission lines in their spectra, most often 
H/3. In addition, it does not account for the presence of “re¬ 
tired” galaxies (following the nomenclature introduced by 
Stasinska et al. 2008). These systems have stopped form¬ 
ing stars and are ionized by their old stellar populations 
(namely the hot low-mass evolved stars, or HOLMES), with¬ 
out a detectable contribution of an AGN. Finally, the K03 
line to separate star-forming galaxies from AGN hosts has 
been drawn empirically without physical justification. Con¬ 
sequently, it is uncertain in the region of the BPT diagram 
where the SF and AGN wings merge. Stasinska et al. (2006, 
hereafter S06) showed that many galaxies below the K03 
line do in fact contain an AGN, and proposed a divisory 
line based on photoionization models to separate pure SF 
galaxies from galaxies containing a detectable AGN. It is to 
be noted that the new photoionization models of Dopita et 
al. (2013) are in agreement with the S06 ones as regards the 
identification of pure SF galaxies. 

Cid Fernandes et al. (2011) proposed a different classi¬ 
fication of galaxies which takes into account the existence of 
retired galaxies by considering the equivalent width of the 
Ka line (in their WHAN diagram). In this paper, we show 
that the WHAN diagram leads to a much more compre¬ 
hensive and reliable view of galaxy evolution than the com¬ 
monly used BPT diagram - although it also shares some of 
the drawbacks of the BPT diagram. 

To properly address the problem of the star formation- 
AGN connection, it is important not only to use safe diag¬ 
nostics of the nature of galaxies, but also to divide galaxies 
according to their masses, since it known that mass is a fun¬ 
damental parameter for the pace of galaxy evolution (Cowie 
et al. 1996; Heavens et al. 2004; Cid Fernandes et al. 2007; 
Asari et al. 2007; Jimenez et al. 2007; Haines et al. 2007). 

Another aspect not to be neglected is that past episodes 
of nuclear activity cannot be detected - one can only rec¬ 
ognize those that take place at the time when a galaxy is 
observed. Splitting the observational data in redshift slices 
therefore can provide some clues on the past activity of 
galaxies. Other factors such as morphology or environment 
also play a role in the evolution of galaxies and have been 
addressed by various authors (e.g. Haines et al. 2007; Barn- 
ford et al. 2009; Lietzen et al. 2011; Schawinski et al. 2010; 
2012, 2014), and will need to be reexamined in future pa¬ 
pers in the context of the present work. Here, we focus on 
just the following aspects: the correct assignation of galaxy 
spectral classes implying a proper identification of the re¬ 
tired galaxies, and the importance of presenting the results 
of any analysis in bins of mass and redshift. 

Dividing the observational data into redshift bins has 


another important merit. It allows one to compare in each 
bin spectra encompassing regions of similar physical sizes 
and to study aperture effects by comparing bins of different 
redshifts. This is important since the 3" diameter SDSS fiber 
does not cover the entire galaxies, especially at low redshifts. 
Aperture effects are expected to be important particularly 
for spirals, for which the central zones contain an old bulge 
and - possibly - an active nucleus while the outer zones 
correspond to the star-forming disk. A full assessment of 
aperture effects requires integral field spectroscopy of repre¬ 
sentative samples of galaxies. A few preliminary studies have 
been published (Brinchman et al. 2004; Kewley, Jansen & 
Geller 2005) but investigations based on integral field spec¬ 
troscopy are just starting (Gerssen et al. 2012; Mast et al. 
2013; Iglesias-Paramo et al. 2013). As will become evident 
in the course of the paper, for most aspects of our study 
we need not separate aperture affects and evolution in the 
interpretation of our results. 

This paper is organized as follows. In Sect. [2l we de¬ 
fine our master sample of galaxies, explain the preliminary 
treatment applied to their spectra and divide galaxies in 
mass and redshift bins. In Sect. [3] we discuss the BPT and 
WHAN emission line diagnostics including aperture effects 
and define BPT and WHAN subsamples. In Sect.Sl we pro¬ 
vide a detailed census of the galaxy emission-line spectral 
types in our master sample. This allows us to proceed, in 
Sect.O to a direct estimation of AGN lifetimes. In Sect. (6] 
we proceed one step further and analyze the star formation 
histories of the different categories of galaxies in mass and 
redshift bins, in an attempt to better understand the inter¬ 
play between the star-forming, AGN and retired phases of 
galaxy evolution in the past 3-4 Gyr. The main outcomes 
of the this study are summarised in Sect.[71 

Three appendices complete this paper: Appendix!^ dis¬ 
cusses selection effects in the SDSS Main Galaxy sample. 
Appendix [B1 warns about the effects of stacking galaxy spec¬ 
tra to discuss galaxy properties, and Appendix [Cl provides 
some additional material. 

Throughout the paper we consider a ACDM cosmology 
with Ho = 70 km Mpc“^, Qm = 0.30, and Qa = 0.70. 

2 THE DATABASE 
2.1 Sample selection 

We consider the 7th Data Release of the Slo an Digital Sky 
Survey (SDSS/DR7, lAbazaiian et al.l l2009l j which covers 
9380 sq.deg. in the sky and presents fiber-fed spectroscopy 
in the wavelength range 3800-9200 A with mean spectral 
resolution of A/AA ~1800 for nearly one million galaxies. 
Since we are interested in the relative populations of dif¬ 
ferent kinds of galaxies, we need a well-defined subsample. 
We therefore restrict ourselves to the Main Galaxy Sample, 
which is a complete flux-limited sample of galaxies down to 
a magnitude rrir = 17.77 (Strauss et al. 2002). Note that, 
because we work with mass-redshift bins, we do not need 
a volume-limited sample, which would severely restrict the 
usable redshift range. This issue will be further discussed 
below. 

We select galaxies with z-band covering factor larger 
than 20% to reduce aperture effects in the determination 
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of the stellar masses. We further impose a cut in redshift 
{z > 0.002) to garantee that luminosity distances are not 
dominated by peculiar motions (e.g. Ekholm et al. 2001). 
This reduces our sample to a total of 574,473 objects. Addi¬ 
tional restrictions have to be made, depending on the prop¬ 
erties under discussion, and will be explicited below. 

2.2 Analysis of the data 

Our work on the galaxy spectra makes use of the spectral 
synthesis code starlight (Cid Fernandes et al. 2005). This 
is a inverse stellar population synthesis code which decom¬ 
poses the observed spectra in contributions from simple stel¬ 
lar populations (SSPs) of given age and metallicity. The li¬ 
brary of SSPs is based on the Bruzual & Chariot (2003) evo¬ 
lutionary models of galaxies, adopting a Chabrier (2003) ini¬ 
tial mass function (IMF), “Padova 1994” evolutionary tracks 
(Bertelli et al. 1994) and the STELIB spectral library (Le 
Borgne et al. 2003), as explained in detail in Asari et al. 
(2007). Spectral regions containing bad pixels, emission lines 
and the Na D doublet are not considered for the fits. The 
stellar extinction is an outcome of the starlight fitting. It 
is computed using a Cardelli, Clayton & Mathis (1989) ex¬ 
tinction law with Rv = 3.1. The total masses of stars present 
in the galaxies, M*, are obtained after correcting for the frac¬ 
tion of the galaxy z band luminosity outside the SDSS fiber 
as explained in Cid Fernandes et al. (2005). The intensities 
of the emission lines were measured after subtracting the 
modelled stellar spectrum from the observed one. More de¬ 
tails can be found in Mateus et al. (2006), Stasihska et al. 
(2006), Asari et al. (2007) and Cid Fernandes et al. (2010). 
All the data used in this paper can be retrieved from the 
STARLIGHT databas^B- 

We generally did not make any restriction concerning 
the signal-to-noise (S/N) ratio in the continuum since we 
are mainly interested in the stellar masses, which are very 
robust outputs from starlight: as shown in Cid Fernandes 
et al. (2005), the uncertainty in M* is of 0.1 dex for a S/N 
ratio of 5 at 4000 A, and objects with smaller S/N represent 
only 1.5 per thousand of the objects in our sample (3 per 
thousand of the ones with redshift z > 0.2). Concerning 
the line intensities, we may have to restrict the sample by 
imposing criteria related to the quality of emission lines, as 
detailed below. 

2.3 Mass and redshift bins 

In order to convey a synthetic view of the realm of SDSS 
galaxies we divide our samples of galaxies in bins of mass 
and redshift. We consider mass bins of 0.5 dex for log mJ 3 
between 8.5 and 12.5. Concerning redshift, we consider bins 
of Az = 0.05 until z = 0.40. These redshift limits correspond 
to sampled galactic radii of 1.5 to 8.1 kpc for the adopted 
cosmology. The widths of the adopted mass-redshift bins are 
a compromise between i) obtaining a significant number of 
objects in each bin to robustly define mean properties and ii) 
defining a sufficient number of bins in the mass and redshift 
ranges considered. Note that the errors in the individual 

^ http://casjobs.starlight.ufsc.br/casjobs/ 
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masses of the galaxies are much smaller than the width of 
the mass bins. 


3 EMISSION-LINE DIAGNOSTICS OF THE 
GALAXIES 

3.1 Some generalities 

As mentioned in the introduction, the vast majority of 
emission-line diagnostics of the ionization of galaxies are 
based on the BPT diagram where pure SF galaxies are con¬ 
sidered to lie below the K03 line and pure AGN above the 
KOI line, the space between the two lines being occupied by 
so-called composite objects. Nevertheless, signs of AGN ac¬ 
tivity can be detected well below the K03 line, as shown by 
S06. On the other hand, “pure” AGNs lie well above the KOI 
line. Note that Kewley et al. (2013) now define another line 
to separate star-forming galaxies from galaxies containing 
an AGN, which actually is very close to the one of S06. 

As shown by Stasihska et al. (2008), the BPT diagram 
is however not able to distinguish between galaxies contain¬ 
ing a weak AGN and retired galaxies, i.e. galaxies ionized 
by their HOLMES (which can also explain the ionization of 
the extraplanar gas in spiral galaxies, see Flores-Fajardo et 
al. 2011). The existence of such galaxies has since then been 
evidenced by integral-field spectroscopy of galaxies, showing 
that the extended emission seen in low-ionization galaxies is 
inconsistent with a central point source for the ionization, 
ruling out nuclear activity as the dominant source for the 
emission (Sarzi et al. 2010; Kehrig et al. 2012; Papaderos 
et al. 2013; Singh et al. 2013). Cid Fernandes et al. (2011) 
proposed to use the equivalent width of Ha, EW(Ha), as a 
function of [N Iij/Ha (the WHAN diagram) to distinguish 
between galaxies containing an AGN and retired galaxies. 
This is the second diagnostic diagram we will consider here. 
Another advantage of using the WHAN diagram instead of 
the BPT is that it requires only 2 lines, while the BPT 
diagram requires 4 lines. We must note, however, that nei¬ 
ther the BPT diagram nor the WHAN diagram are able to 
identify AGNs in low-metallicity galaxies (S06; Groves et al. 
2006). 

The BPT diagram can also be used to rank the metal- 
licit ies in the star-forming wing, since the [O Hi]/[N ii] ratio 
has been shown since Alloin et al. (1979) to be a metallic- 
ity indicator. The [N Iij/Ha ratio has also been shown to be 
strongly correlated with metallicity (Storchi-Bergmann et al. 
1994; Van Zee et al. 1998) in giant H ii regions, and can thus 
serve to rank metallicities in the SF region of the WHAN 
diagram. In the AGN/retired zone, however, its interpre¬ 
tation is not straightforward since [N Iij/Ha both depends 
on the heating power of the main ionizing source (AGN or 
HOLMES) and on the nitrogen enrichment. 

3.2 Aperture effects on the BPT and WHAN 
diagrams 

An important thing to realize is that both the BPT and 
the WHAN diagrams are sensitive to aperture effects. More 
explicitly, the galaxy classification inferred from these di¬ 
agrams depends on the relative proportions of bulge (or 
spheroid) and disk sampled by the SDSS fiber. 
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case 

tiny coverage 

total 

coverage 

BPT 

WHAN 

BPT 

WHAN 

a 

old bulge -h SF disk 

L 

R 

C/SF 

AGN/SF 

b 

AGN + SF disk 

AGN 

AGN 

C/SF 

AGN/SF 

c 

AGN + SF bulge + SF disk 

C 

AGN 

C 

AGN/SF 

d 

SF bulge + SF disk 

SF 

SF 

SF 

SF 

e 

old bulge A old disk/spheroid 

L 

R 

L 

R 


Table 1. Classification of galaxies using the BPT and WHAN diagrams, for two extreme fiber covering factors. “C” stands for “composite, 
“L” stands for “LINER”, “R” stands for “retired”. 


The contingency table presented in Table [T] schemati¬ 
cally shows the diagnoses that would be obtained from the 
BPT diagram in its classical use (since Kewley et al. 2006) 
and from the WHAN diagram for the five possible galaxy 
configurations and two extreme fiber covering factors. We 
note that in case a, both the BPT and the WHAN diagram 
may interpret the emission-line pattern as being a conse¬ 
quence of the presence of an AGN (since “composite” means 
AGN + star-forming) while the galaxy contains no AGN at 
all! The BPT, in addition, finds three more instances of the 
presence of an AGN (in terms of LINER) while the galaxy 
contains no AGN at all. These are cases a and b for a tiny 
fiber coverage, and case e for a total fiber coverage. 


3.3 The BPT and WHAN subsamples 

From our parent sample, we define two subsamples: a sam¬ 
ple based on the WHAN diagram (sample W), and a sub¬ 
sample based on the BPT diagram (sample B). Sample W 
is the least restrictive of the two. Since within the WHAN 
diagram there is a continuity between emission-line galax¬ 
ies and lineless galaxies in the sense that lineless galaxies 
would be found at vanishing values of EW(H(a), here we do 
not impose a condition on S/N on the line intensities. This, 
of course, results in a somewhat uncertain classification of 
galaxies with low S/N but should not bias the global picture. 
On the other hand, we remove galaxies where defects in the 
spectra at the wavelengths of Ha and [N ii] do not allow 
even a coarse evaluation of the ratio of their intensities. To 
do this, we do not consider the criterion of Gid Fernandes 
et al. (2010), since we judge it to be too severe. Instead, 
we measure the nebular velocity dispersion (jgas for the Ha 
and [N ii] lines and count the pixels within ilcTgas of the 
peak of each line. We then remove from our sample galaxies 
with more than 25% bad pixels on either emission line. The 
total number of galaxies that were removed with this crite¬ 
rion is 64,802, leaving 509,671 galaxies in sample W. Sample 
B contains 217,391 galaxies, which are selected by applying 
to sample W the requirement that the S/N must be larger 
than 3 for all the four BPT diagnostic lines: [O iii]5007, H/3, 
[N ii]6584 and Ha. 
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Figure 1. BPT diagrams for sample B (i.e., a subset of 
sample W of objects with good S/N in all four BPT lines) 
in mass-redshift bins. The number of objects contained 
in each panel is indicated. The color scale is in logarithm 
of the number of objects per pixel, as indicated in the 
color bar in the inset which shows the entire sample B 
in the BPT diagram. The curves are the S06 delimita¬ 
tions between pure SF galaxies and galaxies containing 
an AGN. 


Figure 2. WHAN diagrams for sample W (i.e., only re¬ 
moving galaxies with bad pixels in [N ii]6594 or Ho) in 
mass-redshift bins. The number of objects contained in 
each panel is indicated. The color scale is in logarithm 
of the number of objects per pixel, as indicated in the 
color bar in the inset which shows the entire sample W 
in the WHAN diagram. The continuous grey lines indi¬ 
cate the delimitations between SF galaxies, galaxies with 
strong and weak AGNs, and the dashed grey line marks 
the limit for retired galaxies according to Cid Fernandes 
et al. (2011). 


3.4 The BPT and WHAN diagrams in mass and 
redshift bins 

Fig. [T]show the BPT diagram for sample B in the (M*, 2 :) 
bins we have defined. The total number of objects in each 
bin is indicated. The curves represent the S06 line to dis¬ 
tinguish pure SF galaxies from AGN hosts. As expected the 
lowest mass bins are populated only for the lowest redshifts 
(because the sample is flux limited), and the highest mass 
bins have very few objects below z = 0.1 (because of the 
decrease of the galaxy mass function towards high masses, 
see e.g. Panter et al. 2004). We clearly see how - at any 
redshift - the right wing becomes more and more populated 
with increasing stellar mass, leaving virtually no object in 
the SF wing at the highest masses. 

Fig. [2] shows the WHAN diagram for sample W in the 
(M*,z) bins. The grey lines indicate the delimitations be¬ 
tween the four categories of emission-line galaxies accord¬ 
ing to Cid Fernandes et al. (2011). In this diagram we also 
see how the SF zone becomes gradually depopulated with 
increasing stellar mass. But here we understand that this 
change essentially benefits the category of retired galaxies 
and not to that of AGN hosts. 


4 A DETAILED CENSUS OF GALAXY 

EMISSION-LINE SPECTRAL TYPES FROM 
THE SDSS DATA 

Cid Fernandes et al. (2010) discussed the proportions of 
pure star-forming galaxies/AGN hosts/retired galaxies for a 
volume-limited sample of SDSS galaxies with emission lines 
without paying special attention to their masses and red- 
shifts. Here, we extend this discussion by evaluating these 
proportions in mass and redshift bins. 

The first thing to consider in such an endeavour is the 
question of completeness. If observational selection removes 
some kind of galaxies from our sample, we must take this into 
account in the overall census. By working in (M*, z) bins, we 
are less prone to discriminations against certain categories 
of objects (defined by their colours or by their emission-line 
properies). In Appendix |A] we discuss in detail the bias ex¬ 
pected in each (M*,z) bin. It turns out that some bins are 
definitely free from colour bias. At the smallest redshifts, the 
bins with log M* >10 are complete. In the next redshift in¬ 
terval, bins with log M* > 10.5 are complete, and so on. 
All the subpanels where the sample is complete are flagged 
with a yellow background in Fig. EH As shown in Appendix 
El adjacent bins are not much affected by colour bias. As 
redshift increases, the limiting magnitude rrir of the Main 
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Galaxy Sample will reduce the number of complete mass 
bins by playing against galaxies of smaller masses. What is 
problematic for our study is that there could be a colour dis¬ 
crimination against some kind of galaxies. A general prop¬ 
erty of a survey limited by magnitude in the r band is that 
the galaxies that are more easily missed are red ones, since 
blue galaxies with same masses are more luminous, as seen 
in Appendix O Thus, at the lowest redshift, the log M* < 9 
bin could be missing an important number of red galaxies. 
However, the selection bias will become less important for 
9 <log M* <9.5, and will probably vanish for log M* >10. 
In the next redshift bin, the selection bias will start being 
noticeable for log M* < 10.5, an so on. 


4.1 BPT-based demography 

Before turning to the WHAN diagnostic diagram, it is in¬ 
structive to first carry out a demographic study using the 
BPT diagram in the way promoted by K03, since this is still 
the most popular way to separate galaxies into star-forming 
and AGN hosts. 

Figure [3] shows the fraction of galaxies of different 
emission-line spectral types for sample B as a function 
of redshift for our different mass bins, as obtained from the 
canonical use of the BPT diagram, i.e. using the K03 and 
KOI lines to distinguish between SF, “composite” and AGN 
galaxies. In each panel we indicate the total number of ob¬ 
jects represented. The larger symbols represent (M*,z) bins 
that are judged free of bias (see Appendix 0, and adja¬ 
cent bins are probably little affected by bias. We only plot 
bins with a minimum of 5 galaxies. We see that for log 
M* < 9 all the galaxies are of SF type; AGN hosts are found 
only for masses larger than that. Until log M* = 10, SF 
types still constitute the dominant population of galaxies. 
However, as noted above, the Main Galaxy Sample may be 
missing a population of low-mass red galaxies, whose impor¬ 
tance is difficult to assess. Among this population, galaxies 
containing an AGN but not experiencing present-day star 
formation could perhaps exist. At higher values of M* the 
tendency starts reversing and for log M* > 11.5 we find 
that the population of galaxies is vastly dominated by AGN 
hosts. We also note that the proportion of SF galaxies al¬ 
ways tends to decrease with decreasing redshift while the 
proportion of AGNs increases. Qualitatively, this is what is 
expected from mere aperture effects where the contribution 
of the old bulge with respect to the star-forming disk in¬ 
creases as redshift decreases. 


4.2 WHAN-based demography 

As commented before, the BPT diagram demands four lines 
to be observed with good S/N, which selects against weak- 
line objects and leaves aside about half of the galaxies of 
our master sample. More importantly, the BPT diagram is 
not able to distinguish between hosts of weak AGN and re¬ 
tired galaxies since both categories display similar line ra¬ 
tios. Emission-line equivalent widths need to be considered 
as well. 

Now we classify the galaxies according to the WHAN 
diagram, which overcomes these limitations of the BPT. Fig¬ 
ure 0] shows the fraction of SF galaxies, AGN hosts (putting 


strong and weak AGN in a single category) and retired 
galaxies (with or without emission lines) as a function of 
z in the different mass bins in our sample, for both samples 
B and W. 

Let us first focus on the differences between Fig. [3] and 
the top panel of Fig. 01 Both sets of plots show galaxies in 
mass and redshifts bins for sample B, the former classifying 
galaxies based on their position on the BPT and the latter 
on the WHAN diagram. For masses 10.5 < logM* < 12, the 
BPT classes tricks us into finding an increase in AGN ac¬ 
tivity at smaller redshifts. When classifying the same set of 
galaxies using the WHAN this trend disappears. By telling 
apart AGN hosts from retired galaxies, the staggering con¬ 
clusion is that a great part of what is attributed to AGN be¬ 
haviour on the BPT is in fact just due to galaxy retirement. 
Given the dispersion of galaxy types within a mass-redshift 
bin, one needs to be extremely careful when working with 
averaged galaxy properties or stacked spectra (see Appendix 
[0, even when binning in stellar mass and redshift. 

The bottom panel of Fig. 0| shows galaxies as classified 
on the WHAN diagram for sample W, which is much more 
inclusive than sample B. 

The general behaviour is that, as redshift decreases, in 
each mass bin the proportion of SF galaxies decreases, the 
proportion of retired galaxies increases while the proportion 
of AGN hosts rather tends to decrease (at least for log M* > 
10.5). As was the case in Fig. [S] we see that in the Main 
Galaxy Sample SF galaxies dominate the whole population 
of galaxies at any redshift for log < 10.5. On the other 
hand, retired galaxies always dominate the whole population 
for log M* > 11.5. In the intermediate mass bins, retired 
galaxies dominate at the lowest redshifts. As was the case 
for the BPT diagram, the redshift behaviour is qualitatively 
what is expected from aperture effects alone. For example, 
at intermediate masses the increasing proportion of retired 
galaxies as redshift decreases is expected as the fibre samples 
decreasing portions of the star-forming disk. This behaviour 
can also be due to galaxy evolution. 

In spite of this uncertainty in the interpretation, in ab¬ 
sence of any colour bias. Fig. 0] would nicely depict the down¬ 
sizing paradigm in the local Universe. The vast majority of 
galaxies with log > 11.5 have formed all their stars at 
redshift larger than 0.4. Galaxies with 10.5 < logM^ < 11.5 
gradually stop forming stars between z = 0.4 and the present 
time. Galaxies with log M* < 10.5 still form stars presently. 
However, as explained in Appendix]^ we expect the Main 
Galaxy Sample to miss red galaxies for masses below a cer¬ 
tain threshold mass to an extent that we are not able to eval¬ 
uate, but which is certainly more important for the lowest 
masses. This means that the redshift evolution of the spec¬ 
tral types of galaxies with masses below 10^°'^M© cannot be 
obtained from the SDSS Main Galaxy Sample (although we 
will see in Sect. [5] that our spectral-type census is probably 
correct down to 10^° M©). 

In the mass and redshift bins that we consider free of 
colour bias, the population of AGN hosts is never predomi¬ 
nant (except at the highest reshift of the 11 < log M* <11.5 
bin where they outnumber the retired galaxies by a tiny 
margin). Note that we reach such a conclusion in spite of 
the fact that our definition of AGN host is less restrictive 
than the one which is generally used (i.e. that of K03). Also 
noteworthy is the fact that we do not see an increase in 
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Figure 3. Fraction of galaxies of different emission-line spectral types as a function of redshift for the different mass bins. The spectral 
types are obtained from the canonical use of the BPT diagram. Blue triangles: SF; green squares: AGN; cyan circles: composite. Large 
symbols correspond to bins that are judged devoid of colour bias (see text and Appendix [Al) . 
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Figure 4. Fraction of galaxies of different emission-line spectral types (blue triangles: SF; green squares : AGN (strong and weak); red 
circles: retired (with or without emission lines) as a function of redshift for the different mass bins as obtained from the WHAN diagram. 
Large symbols correspond to (M*, z) bins that are judged devoid of colour bias. Top: WHAN classification of sample B, which imposes a 
S/N limit to [O iii]5007, H^, [N ii]6584, and Ha. Bottom: WHAN classification of sample W, which only excludes galaxies whose spectra 
around [N ii]6584 or Ha is unreliable. 


the proportion of AGN hosts with decreasing redshift, as 
was the case with the BPT diagram. Indeed, this increase 
is fake, and only due to retired galaxies being mistaken for 
AGN hosts. 


We have found that the standard use of the BPT dia¬ 
gram and our use of the WHAN diagram provide very dif¬ 
ferent panoramas of the demographics of galaxy emission¬ 
line spectral types across the past 4 Gyr (which is the look- 
back time corresponding to z = 0.4). One advantage of the 
WHAN classification is that it does not set aside a large 
number of the galaxies. The other one is that it does not 
attribute to nuclear activity the emission-line ratios seen in 
weak-line objects. In the remaining of the paper, we there¬ 
fore only consider the galaxy spectral classes that we defined 
with the help of the WHAN diagram (i.e. SF, AGN hosts 
and retired) and only analyze sample W. 


5 AGN LIFETIMES 

AGN lifetimes have been estimated by a number of methods 
based on black hole demographics or AGN radiative proper¬ 
ties which led to quite dispersed results (Martini 2004). Part 
of this dispersion may be due to the fact that the different 
studies do not actually refer to the same objects (e.g. high 
or low level of activity, massive or less massive galaxies). 

From the census of emission-line spectral types pre¬ 
sented above, we can estimate the AGN lifetimes in each 
mass bin (and even in each redshift bin) by dividing the 
number of AGN hosts in a given bin by the total number 
of galaxies in that bin and multiplying it by the age of the 
Universe at the corresponding redshift. This is shown in Fig. 
[5] We find lifetimes of the order of 1-5 Gyr for log M* be¬ 
tween 10 and 12. Statistics for lower masses are affected by 
the colour bias discussed in Appendix so the lifetimes are 
not reliable: since we probably miss retired/red galaxies at a 
higher proportion in the biased bins, the fractions of AGNs 
and SF in such bins are probably upper limits. At higher 
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Figure 5. AGN lifetimes computed from the census of AGN hosts in the WHAN diagram as a function of redshift, for different mass 
bins. Top: considering all galaxies with a detectable AGN; Bottom: considering AG^-dominated galaxies. As in Fig. [3] large symbols 
correspond to bins that are judged devoid of colour bias. 


redshift, the AGN population is also contaminated by bulge 
+ disk systems in disguise (case a of Table [T]) . This means 
that the AGN lifetimes we show are also upper limits in 
these cases. In the 12-12.5 mass range, the lifetimes appear 
to be shorter, about 0.3 Gyr. We can do the same, now re¬ 
stricting ourselves to those objects where the AGN heavily 
contributes to the emission lines (i.e. galaxies which have 
EW(H(a) > SA and are above the KOI line; for simplicity 
we will refer to them AGlSi- dominated galaxies, although this 
term is somewhat inaccurate since stellar ionization might 
be preponderant in a number of cases). This is done in the 
bottom panel of Fig. O Now the lifetime appears to be of 
the order of a few 10® yr. This implies that strong AGN are 
short-lived with respect to weak AGN, or that the phase of 
strong nuclear activity is shorter by a factor of 3-10 than 
the period where nuclear activity can be detected. 

Note that for 10 < log M* < 12 we do not see any sig¬ 
nificant variation of AGN lifetimes with respect to redshift 
even in the redshift domain where absence of colour bias is 
not guaranteed. This can perhaps be taken as an indication 
that the colour bias is not very strong for those bins. Nei¬ 
ther do we see any strong variation of the estimated AGN 
lifetimes with respect to M*. 

It must be noted that the obtained lifetimes are strongly 
related to the criterion used to define the activity, as indeed 
illustrated by the top and bottom panels of Fig.[5l and care 
must be taken when they are used in other contexts. 


6 STAR FORMATION HISTORIES 

We have just seen in the previous section that galaxies host¬ 
ing AGN are, on average, more massive than SF galaxie^. 

^ This was already found by K03, even though their AGN sam¬ 
ple was strongly contaminated by retired galaxies, which are on 
average more massive than SF ones. 


It is also well-known that “red and dead” galaxies are more 
massive than galaxies forming stars presently (Heavens et al. 
2004). However, at a given mass and redshift, some galaxies 
have already stopped forming stars while others still form 
them, some galaxies host an AGN while others do not. It 
has repeatedly been shown by detailed studies of nearby ob¬ 
jects that nuclear activity in Seyfert galaxies is linked with 
some level of star-forming activity (Shlosman 1990; Storchi- 
Bergman et al. 1996; Raiman & Storchi Bergmann 2000; 
Veilleux 2001; Gonzalez Delgado et al. 2009). In order to 
further discuss the relation between star formation and nu¬ 
clear activity, here we study the star-format ion histories of 
the various categories of galaxies considered. 


6.1 Determination of the star-formation histories 

The star-format ion histories can be obtained from the 
STARLIGHT analysis of the galaxy spectra, which provides a 
description of galaxies content in terms of simple stellar pop¬ 
ulations of different ages. The specific star-format ion rate, 
SSFR(t), which measures the mass converted into stars at 
time t with respect to the total mass converted into stars, 
is computed for each galaxy by smoothing the stellar pop¬ 
ulation age distribution, as explained in Asari et al. (2007, 
their Eq. 6) with the following modification. For a galaxy at 
a given redshift, part of the light (hence stellar mass) may be 
attributed by starlight to stellar populations older than 
the age of the Universe at that redshift. In order to cor¬ 
rect for that, we find the lookback time tmax corresponding 
to each galaxy redshift and consider that the SSFR(t) at 
t — tmax is the sum of the contributions of all stellar popu¬ 
lations older than tmax- 

Note that the time resolution of stellar population anal¬ 
yses decreases with lookback time, so that it is not possible, 
for any given galaxy, to detect short episodes of star for¬ 
mation that happened long ago. Besides, the history of the 
activity in galaxies is not recorded in their emission lines. A 
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statistical way to circumvent these limitations is to follow 
a population of galaxies through different redshifts. This 
allows us not only to detect short episodes of star forma¬ 
tion which are nowadays too old to be discerned, but also 
to directly measure the emission-line properties in this past 
epoch. Our dataset allows us to peep into the starburst- 
AGN connection in the last 4 Gyr of the life of the galaxies 
(with the caveat that we look at increasingly large portions 
of galaxies as redshift increases). 


6.2 Star formation histories of the various galaxy 
spectral types 

Figure [6] shows the variations of the median SSFRs (thick 
blue curves) and 16th and 84th percentiles (thin curves) as 
a function of lookback time in each of our mass-redshift bins 
for our three galaxy spectral types. The top panel concerns 
SF galaxies, the middle panel concerns AGN hosts, and the 
bottom panel concerns retired galaxies. Only bins containing 
at least 50 objects are shown, to allow for a robust represen¬ 
tation of the star-format ion history in the considered bins. 
For convenience, in each (M^,z) subpanel, we consider the 
lookback time with respect to the galaxy redshift, and not a 
cosmological lookback time taking into account the redshift 
of the galaxy. The inset indicates the scale on the (log SSFR, 
log t) plane used to represent the star-format ion histories in 
each bin. In each panel, the cyan background points rep¬ 
resent all the galaxies from our sample that belong to the 
same spectral class. 

We now turn to the discussion of the star-format ion his¬ 
tories. Goncerning SF galaxies, the first thing to notice from 
Fig.[6]is that, at any redshift, the SSFR changes from slightly 
decreasing with time for the highest masses (log M* > 11.5) 
through roughly flat for intermediate masses to increasing 
with time for the smaller masses. We also see that, for a 
given mass bin, the ratio of recent to past star-format ion 
rate increases steadily with z for all mass bins. There is, 
of course, a certain dispersion in all those relations, as seen 
from the curves indicating the percentiles, but the described 
tendencies are unambiguous. 

To interpret the observed trends, we first must ask 
whether, in a given mass bin, the galaxies in the panels with 
higher values of z are the predecessors of the galaxies with 
lower z. This is not necessarily true, because of mass growth, 
which, in the (log M*, z) plane, tends to move the galaxies 
towards the right as redshift decreases. This effect, however, 
is in general not crucial for the galaxies of our sample, given 
the width of our redshift bins. Indeed, for a typical SSFR of 
10“^° yr“^ as suggested by the top panel of Fig.[6l the mass 
of stars in the galaxy doubles during a time interval of 10 
Gyr, while the width of our mass bins is 0.5 dex. There are 
episodes where the SSFR is significantly higher than 10“^° 
yr“^, but, as seen in Fig. [6l they are short, so they do not 
contribute much to the stellar mass growth. Thus, roughly, 
in a given M* bin, galaxies at higher redshifts could be con¬ 
sidered as the progenitors of galaxies seen at lowest redshift 
(but, of course, they may change class during their evolution, 
for example an episodic star formation may take place or an 
AGN may appear). In a given mass range, aperture effects 
qualitatively explain the observed tendencies of the SSFR 
with redshift, since as redshift increases increasing portions 
of the star forming disk is covered by the SDSS fiber. In ad- 


0.3 


_.h -9 

10 
11 


0.2 - 


0.1 


T" 


-I—I—I—I—I—I—I—I—I—I—I—I— 

115551 SF 


m 

M -12 

o 


pS -10 E- SFH 

[x. 


7 8 9 10 
log t [yr] 
















J L 




—I I I I I I L 


10 11 
log [Mg] 


12 



Figure 6. Star-formation histories, i.e. SSFR vs lookback time in 
the galaxies redshift frame in (M*, z) bins. Top: SF galaxies; mid¬ 
dle: AGN hosts; bottom: retired galaxies. Thick curves: median 
SSFR; thin curves: 16th and 84th percentiles. SSFRs are shown 
only for bins with at least 50 objects. The cyan background points 
show the (M^, z) diagram for the relevant subsample of galaxies. 
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dition, we must take into account the existence of the bias 
against red galaxies which, in a given mass bin, likely grows 
with increasing redshift (see Appendix I3i. Thus, the appar¬ 
ent redshift evolution of the SSFR in a given mass bin is 
perhaps simply due to these two causes and not an evolu¬ 
tionary effect. 

The middle panel of Fig.[6]is analogous to the top panel 
but is for AGN hosts. Note that, because of our decision to 
show results only for subpanels populated by at least 50 
galaxies, some panels have disappeared while a few others 
have appeared with respect to the SF case. Compared to the 
star-format ion histories of SF galaxies, those of AGN hosts 
show similar tendencies, with, however, the SSFRs being 
lower at small lookback times than those of SF galaxies in 
the same (M*,z) bin. We will return to this point below. 

The bottom panel of Fig.[6]is analogous to the top panel 
but concerns galaxies appearing as retired instead of star¬ 
forming. Here we witness a very strong decrease of the SSFR 
with time in all the subpanels, with a rather small disper- 
Sioii. 


6.3 Comparison of the star-formation histories of 
the three galaxy spectral classes: a clue to the 
star formation — AGN connection? 

Galaxies that are now appearing as retired must have been 
seen as star-forming in the past, perhaps not continuously, 
but at least episodically. Thus their progenitors - or at least 
some of them - are actually in the top (or middle) panel of 
Fig. El It is therefore interesting to compare in more detail 
the star-format ion histories of our three classes of objects. 

Fig. [7] displays the median star-format ion histories of 
SF galaxies (blue curves), AGN hosts (green) and retired 
galaxies (red) in bins. One can see that, at any red¬ 

shift, the star-format ion histories as obtained by starlight 
for SF galaxies and AGN hosts are roughly similar. The 
largest differences occur at small redshift and small masses. 
Another thing to note is that the SSFRs of retired galaxies 
are identical to those of SF and AGN galaxies at the largest 
lookback times diverging at lookback times of one to a few 
Gyr. 

The curves in Fig. [7] can be read as follows. Judging 
from the point where the green and blue curves split apart 
in the figure, the nuclear activity kick-off in SF galaxies 
reduces the level of star formation during the next 0.1-1 
Gyr with respect to SF galaxies in the same (M*, 2 :) bin. 
In low mass galaxies, M* ~ Mq, once the AGN is 

turned on it inhibits star formation for the next ~ 1 Gyr. 
In high mass galaxies this delay seems shorter: ~ 0.1 Gyr 
for ~ 10^^Mq and perhaps even less for higher masses, 
as can be read from Fig. [10] (top), which shows a close-up 
of Fig. [71 All these AGNs keep forming stars, albeit at a 
reduced rate. 


^ We believe that the upturn at small lookback times seen at the 
smallest redshifts is an artefact due to the incomplete treatment of 
horizontal branch and/or blue stragglers in the BC03 evolutionary 
synthesis models used in the starlight analysis. As previously 
shown by Ocvirk (2010) and Gonzalez Delgado &: Cid Fernandes 
(2010), the lack of such old and hot phases tends to produce fake 
bursts in a full spectral fitting analysis. 



log [M^] 


Figure 7. Star-formation histories in mass-redshift bins for all 
the galaxies. They are represented in blue for SF galaxies, in 
green for AGN hosts, and in red for retired galaxies. 

Another interpretation of Fig. [7| could be as follows. 
The population of AGN hosts could actually be composed 
of retired galaxies and of massive SF galaxies, resulting in 
median star-format ion histories of AGN hosts intermediate 
between SF and retired galaxies. In such a circumstance 
Fig. 13 would not necessarily imply the existence of a causal 
link between the AGN phenomenon and recent star forma¬ 
tion. One would then expect a dichotomy in the distribu¬ 
tion of the star-format ion rates of AGN hosts. Fig. [8] shows 
the distribution of the star-format ion rates at t = 24.5 Myr 
(which Asari et al. 2007 found that best matches the Ha- 
calibrated SFR) for the three galaxy types. The distribution 
for AGN hosts is not bimodal, which rules down this second 
interpretation. 

The overall quenching factor (as guessed from the 
SSFR(AGN)/SSFR(SF) ratio in a given bin) decreases with 
increasing M*. At low M*, AGN hosts have specific star- 
formation rates nearly 10 times lower than SF galaxies, while 
at high M* this difference reduces to a factor of 2 or less. 

Fig. [9] is analogous to Fig. [7] but here we only consider 
AGN hosts that are dominated by the AGN (i.e. galaxies 
which have EW(Ha) > SA and are above the KOI line in the 
BPT diagram.) Qualitatively, the behaviour is the same as 
in Fig. [7] except that the quenching factor is larger (compare 
top and bottom of Fig. [TOl). 


7 SUMMARY 

In this paper, we have addressed anew the topic of the star 
formation - AGI^ connection in the context of global galaxy 
evolution by studying several aspects. 

^ Note that our working definition of an AGN host is that the 
galaxy contains a detectable AGN, but the AGN itself may be 
weak, in particular with respect to the ionization due to massive 
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Figure 8. Distribution of recent star formation rates (at look- 
back time of 25 Myr) in mass-redshift bins for all galaxies. His¬ 
tograms for SF galaxies are in blue, for AGN hosts in green, and 
for retired galaxies in red. 



log M* [M,^] 


Figure 9. Similar to Fig.[7|but here the AGN class contains only 
galaxies which have EW(Hq;) > SA and are above the Kewley 
(2001) line in the BPT diagram. They are represented with the 
gold curve. 


• We have paid special attention to retired galaxies, these 
AGN impostors which plague the AGN literature while they 
are merely galaxies that stopped forming stars, commonly 
mistaken for LINERs even in recent studies. 


stars in the vicinity of the nucleus. Gomparison with other studies 
must keep this subtlety in mind. 



Figure 10. A close-up of the lower part of Fig. [7| (top) and Fig. 
[ 9 ] (bottom). 


• We have used a fine mesh of stellar-mass bins to com¬ 
pare the properties of star-forming, AGN hosts and retired 
galaxies, since it is known that the stellar mass is a deter¬ 
minant parameter for galaxy evolution. Studies that divide 
galaxies only with respect to their spectral types are prone 
to misinterpretations due to the fact that the mass distribu¬ 
tions of the three spectral classes considered are different. 

• We also used a fine mesh of redshift bins, which allows 
us to look into evolutionary and aperture effects - although 
it remains difficult to disentangle them. 

We have thoroughly investigated the colour biases in 
the SDSS-DR7 Main Galaxy Sample (Appendix lAl): Due to 
its TTir-limited nature, the sample discriminates against red 
galaxies for decreasing galaxy masses and increasing red- 
shifts. At the smallest reshifts, the Main Galaxy Sample 
misses red galaxies with masses below 10^°M© in propor¬ 
tions that are not possible to evaluate. 

We have shown that the census of emission-line spec¬ 
tral types conveys very different interpretations of galaxy 
evolution when using the BPT diagram or when using the 
WHAN. The WHAN diagram has two important advantages 
over the BPT: i) it does not set aside a large number of the 
galaxies since the requirements for a galaxy to be plotted 
in this diagram are much smaller than for the BPT; ii) 
it does not attribute to nuclear activity the emission-line 
ratios seen in weak-line objects powered by HOLMES, fn 
the remaining of our study, we therefore only considered the 
three classes of galaxies as defined by the WHAN diagram: 
pure star-forming (SF), containing a detectable AGN, and 
retired. 

In the mass range where the Main Galaxy Sample is free 
of colour bias, we find that in each mass bin the proportion 
of SF galaxies decreases with decreasing redshift while that 
of retired galaxies increases. Such a trend is expected in evo¬ 
lutionary terms (like people, galaxies evolve to retirement), 
but this in not an unambiguous interpretation given that 
aperture effects work in the same sense. The vast majority 
of galaxies with M* > 10^have formed all their stars 
at redshift larger than 0.4. 

We find that the population of AGN hosts is never dom¬ 
inant for galaxy masses larger than 10^°M©. Note that we 
reach such a conclusion in spite of our definition of AGN 
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host being considerably less restrictive than the one gener¬ 
ally used, since we consider as AGN hosts all the galaxies 
that contain a detectable AGN, and not only galaxies whose 
ionization is dominated by an AGN. 

From the census of emission-line spectral types we have 
been able to infer the AGN lifetimes without any assumption 
on the physics of accretion on the massive black hole nor on 
the radiation properties of the AGN. For detectable AGNs 
we find a lifetime of < 1 — 5 Gyr for galaxies with masses 
between 10^° and 10^^M©, and ^ 3 x 10® yr for masses 
above 10^^M©. The lifetimes of the AGN- dominated phases 
(i.e. corresponding to galaxies which have EW(H(a) > SA 
and are above the KOI line) are shorter: a few 10® yr. Of 
course, the lifetimes obtained in such a way are dependent 
on the criterion used to define the activity - and should be 
used only in similar contexts. 

A deeper and independent insight into the mani¬ 
fold of galaxies is provided by the studies of their star- 
formation histories, as obtained by the spectral synthesis 
code STARLIGHT. In a star-forming galaxy, the onset of nu¬ 
clear activity reduces the star-format ion rate. Once the AGN 
is turned on it inhibits star formation for the next ~ 0.1 Gyr 
in galaxies with M* ~ 10^°M©, and ~ 1 Gyr in galaxies 
with M* ~ 10^^ M0. The overall quenching factor is lower 
for higher values of M*. In AGN-dominated galaxies, as op¬ 
posed to galaxies with a detectable AGN, the AGN phases 
are shorter: a few 10® yr. At low M*, the quenching factor is 
larger. Note that the values for the AGN lifetimes obtained 
from demographic arguments and from star-format ion his¬ 
tory considerations agree very well with each other. 

Of course, none of this tells us anything about the phys¬ 
ical conditions which lead a galaxy to ignite its supermas- 
sive black hole, nor how exactly this inhibits star forma¬ 
tion. Internal (bars and other angular momentum related 
physics) and/or external (interactions and environment) fac¬ 
tors should play a role. 
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APPENDIX A: ABOUT SELECTION EFFECTS 
IN THE SDSS MAIN GALAXY SAMPLE 

Here we analyze whether in our sample there is a selection 
effect for or against certain galaxy colours due to the fact 
that the Main Galaxy Sample (MGS) of the SDSS is lim¬ 
ited in r apparent magnitude to rrir >17. For that, in a 
first approximation we consider a toy-model (model Ml) in 
which a galaxy is composed of a very young stellar popula¬ 
tion of 1 Myr and of an old stellar population of 13 Gyr and 
we vary the proportion in mass of those two stellar popu¬ 
lations. The SSP models are solar metallicity models taken 
from Bruzual V Chariot (2003). For each proportion of the 
young and old stellar populations, we compute the resulting 
colour u — r, applying the response curves of the SDSS u and 
r filters. We then compute the absolute magnitude in the r 
band corresponding to a given value of M*, and the appar¬ 
ent magnitude rrir corresponding to a given redshift z. The 
top panel of Fig. IaT] shows the colour-magnitude diagram 
for the models Ml inside each of our mass-redshift bins. For 
each subpanel, the red curves represent sequences of models 
for the central values of and z and for the formal extreme 
values for the considered bin, hence we have five curves in¬ 
side each subpanel (the models are shown only for bins con¬ 
taining at least one object from our WHAN subsample). The 
SDSS Main Galaxy Sample limit {rrir — 17.77) is marked by 
the blue horizontal lines. In each subpanel the green points 
show the corresponding observed colour-magnitude diagram 
(the effect of extinction correction is minimal on the overall 
diagram). 

The first thing to notice in Fig. lAlf topi is that the red 
curves reach a plateau as soon as u — r > 1 mag, which effec¬ 
tively means that, for the colours seen in the our sample, rrir 
is in practice a measure of a galaxy’s stellar mass. We also 
note that in a certain number of bins, all the red curves are 
above the blue line, meaning that the sample is complete for 

^ The Main Galaxy Sample also imposes a cut on the half-light 
surface brightness (/rso) of 24.5 mag arcsec“^. After applying the 
rrir criterion, only 1% of the objects classified as galaxies in the 
SDSS fall into the low surface brightness region (23.0 < < 

24.5; Strauss et al. 2002), so this has virtually no impact on our 
study. 


14 


Stasinska et al. 



9 10 11 12 

log M, [MJ 


0.3 


0.2 


0.1 


0 


Figure Al. Colour-magnitude diagrams of galaxy toy-models in 
bins of mass and redshift. Top: models Ml; bottom: models M2. 
The inset indicates the magnitude scales. The blue horizontal line 
indicates the limiting magnitude nir = 17.77 of the Main Galaxy 
Sample. In each subpanel, the corresponding number of WHAN 
sample galaxies is indicated and their colour-magnitude diagram 
is plotted in green. 
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those bins even 


taking into account the dust attenuatior0. 


This is the case for log M* > 10 in the smallest redshift bin, 
for log > 10.5 for z < 0.1, for log M* > 11 for z < 0.15 
etc. In addition, for those bins where the sample is not com¬ 
plete, i.e. the bins for which the domain of the red curves 
is crossed by the blue line, the slope of the red curves is 
zero. This means that, even in bins where the sample is not 
complete, there is no bias against any particular colour. One 
should however be cautious with few bins where the sample 
is missing a large fraction of objects, i.e. log M* < 9.5 for 
z > 0.05, log < 10 for z > 0.1, log M* < 10 for z > 0.15, 
log M* < 11 for z > 0.2. 

However, our toy model combining one very old (13 
Gyr) and one very young (1 Myr) stellar population is not 
the most appropriate to represent the majority of old galax¬ 
ies. We therefore computed a second toy model (model M2) 
in which the SSP to be combined to the 13 Gyr one has an 
age of 2.5 Gyr. The results are shown in the bottom panel of 
Fig. HU Of course, in this case, all the models are red and 
do not represent the bulk of the Main Galaxy Sample. But 
they show that the Main Galaxy Sample certainly misses 
red objects at low values of M* for the smallest redshifts, 
with increasing limits on M* for increasing redshifts. 

It is unfortunately impossible to quantify in each 
(M*,z) bin the bias against red galaxies that is present in 
the Main Galaxy Sample due to the limitation in rur- What 
we can do is to compare our toy models with the obser¬ 
vations in Fig. lAir bottomj to judge which are the (M*,2:) 
bins which are definitely complete (these are indicated the 
yellow background in the figur^ll). Adjacent bins are prob¬ 
ably only slightly incomplete in red objects, while the bias 
may become non negligible as M* decreases. For example, 
we could be missing a large amount of red galaxies with 
M* < 10^ M©. 


APPENDIX B: A WARNING ON STACKING 
GALAXY SPECTRA 

In the main body of this paper we have studied the disper¬ 
sions of galaxies properties within given mass-redshift bins 
and shown that one single bin can be populated by galax¬ 
ies with completely different excitation sources. In this ap¬ 
pendix we warn against misinterpretations from common ap¬ 
proaches using average galaxy properties or spectral stacking 
(Dobos et al. 2012; Vitale et al. 2013; Juneau et al. 2014). 
What follows is a didactic exercise to show what would be 
concluded had we chosen to use these approaches. 

We divide our parent sample in mass and redshift bins 
as in the rest of the paper. The only change is to extend 
the extreme mass bins to include galaxies with log M* < 9 
and log > 12. We then consider each mass-redshift bin 
as a single point to be plotted on the BPT or the WHAN 
diagram. 


® The distribution of the line-of-sight attenuation parameter Ay 
peaks around 0.0 and 0.5 mag for retired and star-forming galax¬ 
ies respectively, while the nebular Ay for the Balmer decrement 
is consistently around 1.0 mag for the emission-line retired and 
varies from 1.0 in the highest-redshift bin to 2.0 mag in the lowest- 
redshift bin, as can be seen in Figs. IC2l and IC3I of Appendix [C] 
This means that the correction for attenuation is of the order of 


1 mag, which is very small compared to the rrir range in our toy 
models. 

^ An additional potential issue is the contribution of the Ha line 
emission to the mr- However, given the colours of the galaxies 
in the considered sample, the effect - as can be judged from the 
computations by Garcia-Vargas et al. (2013) - is minor. In the 
most extreme cases, it would act in favour of the bluest and least 
massive galaxies. 
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Figure Bl. Locations in the BPT and WHAN diagrams of galaxies with redshifts between 0.002 < 2 : < 0.35 grouped in mass-redshift 
bins. Larger circles represent higher redshifts. The different colours represent the different mass bins, and thin dashed lines join points 
in the same mass bin. The typical uncertainty is drawn on the top right of each panel. The red curve on the BPT represents the S06 
delimitation between pure star-forming galaxies and galaxies containing an AGN. The full and dashed black lines on the WHAN are 
the demarcation of different galaxies as in Cid Fernandes et al. (2010). Top panels: locations obtained from the stacked spectra. The 
error bars are the uncertainty in the line fluxes from the gaussian fitting code. Bottom panels: locations obtained from the averages of 
individual line ratios in each bin, considering only galaxies with good spectra. The larger error bar is the standard deviation and the 
smaller one is the standard deviation of the mean. The background density map represents the location of the individual positions of the 
galaxies in our parent sample. 


First we produce representative spectra in each mass- 
redshift bin by stacking all the spectra pertaining to the bin. 
This allows one to provide a synthetic view of the galaxy 
spectra in the space. Stacking is generally used to 

enhance the signal-to-noise ratio of spectra of objects be¬ 
lieved to belong to the same category, and, for example, 
would be needed to obtain accurate star-format ion histories 
with STARLIGHT for galaxies at higher redshifts. This is not 
necessary for SDSS galaxies, since individual SF histories 
have been obtained but it is nevertheless interesting to com¬ 
ment on the biases that stacking may produce. Obviously, 
stacking erases information on individual galaxies and, if, in 
spite of our fine meshing, each bin contains galaxies of differ¬ 
ent types, the stacked spectrum may not correspond to any 
galaxy known. We anticipate that if, as is currently accepted, 
mass is the main driver of galaxy evolution, the continuum 
in the stacked spectrum should represent the stellar content 
in all the concerned galaxies reasonably well. On the other 
hand, the emission lines can be due to a variety of causes, 
most of them occuring on a short time-scale, the most ob¬ 
vious being current star formation or nuclear activity, and 
their intensities may vary significantly within a bin. 

Before stacking, we normalize the spectra to the same 


value of the stellar continuum at a restframe wavelength of 
4010 — 4060A. Since even in a given redshift bin the wave¬ 
length ranges vary according to the redshift of each object, 
we stack the spectra considering only the wavelength range 
common to all the spectra in the bin. Other ways to stack 
spectra would have been to use median stacking or light- 
weighted averages instead of average stacking. Independent 
of the method, the information of the population of galaxies 
within a bin would be erased. 

The top panels of Fig. IBII show the position in the BPT 
and WHAN diagrams of hctive objects created by our stack¬ 
ing procedure. For the BPT we show sample B and for the 
WHAN sample W. Each colour represents a different mass 
bin, following the palette used in Fig.[3]and[31 and the size of 
the symbols correspond to the redshift (size increases with 
increasing redshift). The points are rather dispersed, but 
some general trends are visible. From the BPT diagram one 
might have the impression that galaxy masses determine if a 
galaxy has ongoing star-format ion, has an important AGN 
or is in between. From the WHAN diagram, we might say 
that the retired galaxies dominate the highest masses, the 
AGN hosts dominate intermediate masses and SF galaxies 
dominate low masses at all redshifts studied here. 


























16 


Stasinska et al. 


—1-1-1-1-1-1-1-1 

: 115551 SF 

133775 AGN 
- 260345 Retired 

1-^^ 

i ■ 

1 

Fa 



1 



— tj- 


- 


' 1 





f ■ 

- 

1 


A 




h' : 

1 


L 





- 

- “^1 

—— |t 






- 

1 1 

_1 1 


1 1 

1 ■ 


1 


J _I ' I I I > I ■ I I I ■ I I ■ I_|_Lj_I__L 

9 10 11 12 


log M, [M^] 


A* [mag] 


0 1 

0 1 

0 1 

0 1 

0 1 

0 1 

0 1 

0 1 



11111111 

11111111 

1 

+ ' 

H^n-p-r 

jl " " 1" 

laJYYO AUIN 

- 260345 Retired 

1 


A 



- 


^ , 

T 


A 

A 


_ 

1 

-+ 

+ 


dr 

dz 


- 







- ' 

- 

n 

H- 

r 

A 

A 






A 



H- 



_1_ 

. 

A, 



H- 

l-"\ 1 

' , 



1^ ■ _ L_lZj _I_ L:L_i _ l-Cj _I__ i_ILi _I__L 

9 10 11 12 

log M, [M^] 


Figure Cl. Distribution of stellar masses in each (M*, z) bin. 
The extremities of the horizontal segments (arbitraly vertically 
shifted for clarity) indicates the 16th and 84th percentiles of the 
M* distribution, while the vertical tickmark indicates the median 
value. The different classes of galaxies are represented with the 
following colours. Blue: SF, green: AGN and red: retired. 


Figure C2. Distribution of values of Ay, the extinction obtained 
from the starlight analysis, in each stellar masses in each (M^, 
z) bin. The extremities of the horizontal segments indicates the 
16th and 84th percentiles, while the vertical tickmark indicates 
the median value. The grey-coloured scale at the top of the dia¬ 
gram indicates the scale for Ay in each mass bin. The different 
classes of galaxies are represented with the following colours. Blue: 
SF, green: AGN and red: retired. 


Another way to represent a collection of galaxies in an 
emission-line diagnostic diagram - if the spectra are, like in 
the SDSS, of good enough quality to allow the measurement 
of emission-line intensities for each galaxy individually - is 
to plot for each (M*, z) bin a point whose coordinates are the 
averages of [O iii]/H/3 [N ii]/Ha or EW(Hq:) for all the ob¬ 
jects with sufficiently good quality. This way, we are select¬ 
ing a different sample from the one before and giving more 
weight to emission-line galaxies. The result is shown in the 
bottom panel of Fig. EH The points are far less dispersed, 
but the interpretations of the trends for “fictive” galaxies 
has not changed by much, except for the fact that the two 
highest-mass bins now show the same redshift trends as the 
lower mass bins. 

The puzzling behaviour of average properties seen 
above, either from stacking or from combining individual 
line measures, calls for great care when interpreting those 
results. In the ideal case, one should examine the dispersion 
of properties inside a bin, which is possible with SDSS data. 
For example, certain galaxies do not display emission lines 
at all, yet they enter in the construction of stacked spectra 
but not on the average emission-line properties inside a bin. 
In some mass bins, a fraction of the galaxies can be pure 
star-forming (except perhaps for an old bulge), while others 
may contain an AGN. It is important to study the distribu¬ 
tion of the emission-line properties in a case where - as here 
- this is possible. 

Indeed, when going to higher redshift, one might not be 
able to do this and will have to rely only on stacked spectra. 
So examining the dispersion of properties in cases where it 
is possible is an important way to educate oneself about 
possible misinterpretations of stacked data. 
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Figure C3. Same as Fig. IG2l for the extinction Ay derived from 
the Balmer decrement, when both Ho: and H/3 are observed with 
a signal-to-noise ratio larger than 3. 

APPENDIX C: ADDITIONAL MATERIAL 

To aid the discussion of the star-format ion histories in mass 
and redshift bins, it is instructive to visualize the distribu¬ 
tions of certain properties of the galaxy classes considered, 
in every (M*,z) bin. All the figures below follow the same 
colour scheme: blue for SF galaxies, green for AGN hosts and 
red for retired galaxies. The considered sample is sample W. 

Fven in the case of our very fine M* and z gridding, 
differences may appear in the M* distributions of the differ¬ 
ent classes of galaxies considered, and, for a same class, for 
different values of z. Figure ICT] summarizes the distribution 





































































Retired galaxies and the star formation - AGN connection 17 


[mag] 


15 

15 

15 

15 

15 

15 

15 

15 

: 115551 SF 



1 1 1 1 1 

' ' 1 ' 'i 
A 

1 1 !_!_!_ 

i-rj-i-i-j 

yiaJYYO AUIN 

- 260345 Retired 


1 



-i 

- 


1 






- 

1 




1 


V 

1 




1 


i 

h 


_ 


1 

i 

i 



^ A 


, ,5 

.4 

. 4 


, 



ID _I_ iZl—i ___ ± _i_l_L_I__L 

9 10 11 12 

log M, [M^] 


r values of retired galaxies exceed those of AGN hosts by 
typically 0.2 dex, and that those of the latter exceed those 
of SF galaxies also by about 0.2 dex. 


Figure C4. Distributions of rrir for the various galaxy spectral 
types. 
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Figure C5. Distributions of the u — r colours for the various 
galaxy spectral types. 


of stellar masses in each (M*, z) bin. The extremities of the 
horizontal segments indicates the 16th and 84th percentiles 
of the M* distribution, while the vertical tickmark indicates 
the median value. We see that, although the mass distribu¬ 
tions are not identical in each subpanel, the median values 
for the SF, AGN and retired classes are very close to each 
other. 

Figures [C2] and [C3] show the distribution of the stellar 
extinction obtained from the starlight analysis, and the 
Balmer-line extinction, respectively. The fact that the the 
distributions of Balmer-line extinctions are broad for the 
retired galaxies is actually due to the large uncertainties of 
this parameter for this category of objects. 

Figure [C4] shows the distributions of rrir for the various 
galaxy spectral types. It illustrates the increasing effect of 
the observational cut-off with increasing redshift. 

Figure IG5I shows the distributions of the galaxy u — 
r colours. We see that in each mass-redshift bin, the u — 































